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Abstract 

Starting from a protein solution which is metastable with respect to the crystalhne phase, the effect of adding 
semidilute non-adsorbing polymer is considered. It is found to increase the chemical potential of the protein by a 
few tenths of kT, which may be enough to lower the barrier to nucleation of the crystalline phase by enough to 
allow crystallisation. It is also shown that assuming that the polymer induces a pairwise additive attraction leads to 
qualitatively incorrect results. 



1 Introduction 

Crystallising proteins from dilute solution is in general 
rather difficult, yet the crystalline form is required in order 
to perform X-ray crystallography.^'^ X-ray crystallogra- 
phy is currently the best method of determining the all- 
important structure of a protein in its native state. This 
has lead to considerable interest in how proteins crystallise; 
see Ref. 3 for a recent review from a colloidal physics per- 
spective and Refs. 4-8 for recent work. In a dilute solution 
there may be a large free energy barrier to nucleation of 
the crystalline phase,^'^'^ which prevents nucleation on ac- 
cessible time scales. Here, we consider a generic model 
protein which in the absence of polymer already has coex- 
istence between a dilute solution and a crystalline phase. 
We determine how the barrier to homogeneous nucleation 
of the crystalline phase changes when non-adsorbing poly- 
mer coils which are larger than the protein molecules are 
added to the protein solution. We find that the polymer 
lowers the nucleation barrier. This is to be expected as 
it is well-known that non-adsorbing polymer induces an 
effective attraction between the protein molecules, and an 
attraction will favour the dense crystalline phase. However, 
what is much less common, is that for large polymer coils it 
is essential to account for the fact that the polymer-induced 
attraction is strongly non-pairwise additive. A naive cal- 
culation using in the crystalline phase, the effective pair 
potential in dilute phase of protein molecules, greatly over- 
estimates the effect of the polymer. 

Protein molecules are typically a few nms across and the 
root-mean-square end-to-end distance of a flexible polymer 
can easily be several times this. The protein-protein in- 
teraction consists of a repulsive core of diameter D and an 
attraction which is sufficiently strong to cause the equilib- 



rium behaviour to be coexistence between a dense crystal 
and a dilute fluid phase. We do not need to specify the 
potential explicitly as we will require only the fluid-crystal 
interfacial tension 70 and the difference in chemical poten- 
tial between that of the crystal and that of the fluid, A/iQ. 
We use the subscript to indicate quantities before poly- 
mer is added. Our results should therefore apply to a very 
wide range of proteins despite potentially very large varia- 
tions in the structure and interactions of different proteins.^ 
The polymer is non-adsorbing: the interaction between the 
protein molecule and a monomer is purely repulsive. 

There has been extensive work done on systems of small 
colloidal particles immersed in polymer solutions of larger 
polymer coils which do not adsorb on the particles. Scal- 
ing theory for these systems was pioneered by de Gennes,^^ 
see also Refs. 12-15. Field-theoretic methods have yielded 
results which are exact for small hard particles and ideal 
polymers, as well as approximate results for polymers in 
good solvents. Work has also been done using inte- 
gral equations, see Ref. 19 and references therein. It is 
well understood that non-adsorbing polymer increases the 
chemical potential of particles and induces an attraction 
between these particles which is not pairwise additive: the 
interaction of three close-by protein molecules is not the 
sum of three pair potentials. "'^^'^""^^ However, most of this 
work (exceptions are Refs. 20, 21) only considered a few 
particles, two or three, whereas a nucleus of a crystalline 
phase contains 10s of tightly-packed particles. Because of 
this, the deviations from pairwise additivity observed here 
are much stronger than found in previous work. 

The next section briefly outlines classical nucleation 
theory for the solution before polymer is added. Section 
3 then describes the effect of polymer and section 4 is a 
conclusion. 



2 Classical nucleation theory 

The interaction between protein molecules is taken to in- 
clude a steeply repulsive core and an attraction. This at- 
traction may be isotropic or highly anisotropic or a mixture 
of the two, if isotropic it may or may not be short-ranged; 
all we require is that it render a dilute solution of the pro- 
tein metastable with respect to a crystalline phase of the 
protein. By metastable we mean that the fluid phase is 
not the equilibrium phase, it does not correspond to the 
absolute minimum in the appropriate free energy, but the 
fluid phase is dynamically stable with respect to crystalli- 
sation for times much larger than the characteristic time of 
the solution. See Ref. 23 for a general introduction to the 
properties and behaviour of metastable fluids. 

Classical nucleation theory^'^ assumes that the nu- 
cleus of the crystalline phase has a free energy AF which 
is the sum of a bulk term and surface term. See the book 
of Debencdctti^'' for an excellent introduction to classical 
nucleation theory. The protein molecules are modeled by 
spherical particles of diameter D. The bulk term is equal 
to the number of molecules in the nucleus, n, times the 
chemical potential difference A/x = — l-t, where fj,x is 
the chemical potential of the crystalline phase and /x is the 
chemical potential of the fluid phase which contains the nu- 
cleus. The fluid phase we are considering is not the true 
equilibrium phase, nucleation occurs from a fluid which is 
at a higher chemical potential than the crystal. Thus A/x 
is negative. The surface term is the surface area of the 
nucleus times the surface tension 7 of the bulk interface 
between the coexisting crystalline and fluid phases. The 
surface area of the nucleus is obtained by assuming that 
the nucleus is a sphere of radius R which is related to n by 
assuming that the density of spheres within the nucleus is 
equal to the bulk density of the crystalline phase, taken to 
be D~^: the density of a close-packed simple cubic lattice. 
Then n = {A/'i)-KR^D-^ . Putting this all together. 



AF = -ttR^D-^Ah + AttR'^j 



(1) 



The flrst term in AF is the bulk term, which is negative 
and decreases linearly with n, and the second term is the 
surface term which is positive and increases as n^^^. Thus 
AF passes through a maximum, denoted by AF* , at some 
value of R, denoted by R* . This maximum is the top of the 
free energy barrier to nucleation. The nucleus with radius 
R* and n* spheres is called the critical nucleus. Taking the 
derivative of Eq. (1) with respect to R and equating to 
zero. 



R* 



|Am| 



and inserting this value of n in Eq. (1) 



AF* = 



3A/i2 • 



(2) 



(3) 



where k and T are Boltzmann's constant and the temper- 
ature T, respectively. To increase the rate of nucleation of 
the crystalline phase of the protein we need to reduce the 
barrier AF*. In the absence of polymer AF* and R* are 
obtained from Eqs. (2) and (3) by setting A/U = A/xq and 
7 = 70- 

3 Nucleation in the presence of 
semidilute polymer 

Now we consider a solution of both protein molecules and 
flexible polymer molecules. The interaction between a pro- 
tein molecule and a monomer is taken to be strictly re- 
pulsive. Figure 1 is a schematic of the nucleus in a so- 
lution of polymer. We remark that this simple model is 
not adequate for all polymers, for example it is not ade- 
quate for PEG (poly(ethylene glycol)). We consider the 
case where the polymer molecules are larger than the pro- 
tein molecules and the polymer solution is semidilute. As 
the protein molecules are only a few nms in diameter this 
regime is easily accessible in experiment. In the other limit, 
where the polymer's radius is smaller than that of the col- 
loidal particles, the polymer induces a short-ranged deple- 
tion attraction between the particles. Such attractions are 
to a good approximation pairwise additive^*^ and therefore 
straightforward to treat. See Refs. 22,26-29 for work where 
the polymer is at most as large as the colloid. 

See the book of de Gennes^° for an introduction to 
semidilute polymer solutions. They are characterised by 
a correlation length ^ which scales as c^^^ , where cm is 
the monomer concentration. The osmotic pressure exerted 
by the polymer 11 ~ kT(,~^. 

The crystalline phase of the protein is dense, the gaps 
between protein molecules available to polymer arc around 
QAD or less across. Forcing a polymer molecule into a tube 
of diameter QAD costs roughly kT in free energy per 0.4F' 
of the tube's length as the polymer collides with the walls 
of the tube at intervals of roughly its diameter. Thus, the 
free energy density inside such a crystal, either a bulk crys- 
tal or a crystalline nucleus, will be of order kT per Q.QQD^ 
volume whereas in solution it is kT per volume. For ^ 
larger than D, which we assume throughout, the free en- 
ergy density inside a crystal is so much higher than outside 
that to a good approximation there is no polymer inside 
the crystal. This agrees with calculations using the theory 
of Ref. 20. So, as the crystalline nucleus is impermeable to 
polymer it interacts with the polymer, as a hard particle of 
radius R. 

The interaction w of a hard spherical particle of diam- 
eter D with a semidilute solution of polymer in a good 
solvent with a correlation length ^ is^^~^^ 



vjr^kT 



(4) 



Nucleation is an activated process with AF* as the barrier 
height and so the nucleation rate varies as exp(— AF*/fcT), 



where we took the exponent v to have its Flory value of 
3/5.^° w is the work done (= the difference in excess chem- 
ical potential) in taking a particle from a pure solvent and 



inserting it into the polymer solution. This assumes that 

the interaction between a protein and the monomers of a 
polymer is purely repulsive and that there is not much more 
than one protein molecule per volume. The notation ~ 
indicates that we have neglected a coefficient of order unity; 
here we derive only the scaling behaviour of the interactions 
with respect to the relevant length scales. Equation (4) can 
be derived in a couple of ways, see Refs. 11-14. One way is 
to note that when <C ^ the interaction between a particle 
and polymer must scale as the density of monomers, which 
scales as ^"''^^ when ly = 3/5. As w/kT is dimensionless, 
we require that appears as the ratio {D/S^Y^^ which 

gives Eq. (4). 

Straightaway we can derive an approximation for the 
contribution of polymer to the difference in excess chemi- 
cal potential A/x between the dense crystalline phase of the 
protein, and the semidilute polymer solution. It is 



4/3 



(5) 



where the second term on the right hand side is the change 
in free energy of the polymer when a single protein molecule 
is removed from the semidilute polymer solution and the 
last term is the change in free energy of this solution when 
it is compressed by the crystalline phase of the protein ex- 
panding by the volume of one protein. The osmotic pres- 
sure of a polymer solution 11 is related to ^ by II ~ kT^~^. 
Thus, the last term is equal to kT{D/^Y s-nd so is smaller 
than the second term, indeed it is of order terms we have 
dropped and so we drop it and obtain our final expression 
for A/x in the presence of polymer. 



A/x ~ A/xo 



(6) 



For a large nucleus, radius R ^ ^, the width of the 
nucleus-polymer-solution interface. ^. is small and so the 
capillarity approximation which underlies classical nucle- 
ation theory still holds: AF is still a sum of bulk and in- 
terfacial terms. The bulk term is given by the A/i of Eq. 
(6) times the number of protein molecules in the nucleus. 
In the presence of semidilute polymer the the interfacial 
tension of a flat interface, 7 is,^^ 



7- 



kT 

e 



To + 72-' 



(7) 



where 70 is the interfacial tension in the absence of poly- 
mer. Bearing in mind that 70 will be at least of order 
kT/D^ we see that the fractional modification to 70 due to 
polymer is of order (£)/^)^. For a A^o of order kT, the 
fractional change to A/i due to polymer is, see Eq. (6), of 
order (D/^)^^^. Within our crude scaling theory we only 
consider leading order contributions and so neglect the con- 
tribution of the polymer to 7, leaving the only contribution 
of polymer as the change in A/i given by Eq. (6). 

In the opposite limit for ii, <C ^, then the contri- 
bution of polymer to the free energy of the nucleus is no 



longer the sum of bulk and interfacial terms, i.e., it is not 

the sum of terms scaling as and as R^. For i? <C ^ the 
contribution of the polymer to the free energy of formation 
of the nucleus AF* is the sum of two terms: the first is the 
number of protein molecules in the nucleus times minus the 
increase in chemical potential in solution due to polymer, 
Eq. (4), and the second is the free energy cost of inserting 
the nucleus into the polymer solution. This second term is 
the free energy cost due to the nucleus excluding polymer 
from a sphere of radius R. Because the polymer cannot 
penetrate into the nucleus it interacts with the polymer as 
a single particle, and so this free energy cost is just Eq. (4) 
with D replaced by 2R. Adding the two terms together we 
have the contribution of polymer to AF, 



'r)\4/3 /OP\4/3 

AF = AFo-nkT{jj +kT(^] 



i?<e, (8) 



where AFq is the free energy of formation of the nucleus be- 
fore polymer is added. For R a few times D and R,D<^$, 
the second term in Eq. (8) dominates the last term. So, 
we neglect the last term in Eq. (8). Then as the remaining 
term is linear in n it is in effect a shift in A/i and indeed is 
the same shift in A/z we found in the opposite limit, R^ ^. 
Thus we conclude that as for i? 3> ^ the leading order con- 
tribution of semidilute polymer to the nucleation barrier is 
simply to shift A/z by the amount given in Eq. (6). 

So, we have shown that in both limits, i? ^ ^ and 
i? <C ^, the effect of adding polymer in the semidilute 
regime is to shift the chemical potential difference A/i by an 
amount of order — (Z)/^)'*/^fcr, which will be a few tenths 
of kT. Having shown that it holds in both limits we assume 
that it also holds for R/^ = 0(1) as well. The shift varies 
as ^""'/^ and so increases linearly with the density of poly- 
mer. The size of the critical nucleus also decreases, Eq. (2). 
Thus, if decreasing A/i by a few tenths of kT is enough to 
reduce the nucleation barrier sufficiently to make the nu- 
cleation rate significant then adding semidilute polymer is 
a viable way of inducing nucleation of the crystalline phase 
of small colloidal particles such as proteins and micelles. 

3.1 Comparison with assumption of pair- 
wise additivity 

A pair of particles in a semidilute polymer solution sep- 
arated by a distance x of order ^ or less feel a polymer- 
induced attraction towards each other, which we denote by 
For small particles, Z) <C ^, and for separations 



1^2- 



14,17,18 



r much smaller than the correlation length^^' ^'^ 



W2 



-(f)"'(7)"' 



(9) 



Naively it might be thought that the contribution of poly- 
mer to the free energy change A/i could be estimated using 
this pair attraction plus a mean-field approximation, which 
would correspond to the approximation 



A/i ~ A/io + D~ 



dvw2 {r) wrong. 



(10) 



r>D 



which is just the integral over the number density, in 

the crystalline phase, times the potential. As usual with 
a mean-field approximation, correlations in the positions 
of particles are neglected and one protein is considered to 
interact with surrounding particles which are at the mean 
density. The above integral is times the contribu- 

tion of polymer to the second virial coefficient. See Eisen- 
riegler^* for a more accurate treatment of the contribution 
of polymer to the second virial coefficient. So, inserting Eq. 
(9), into Eq. (10), we obtain 

A/i - A;Uo ~ -kT (^-|^ wrong. (11) 

If we compare Eqs. (6) and (11) we see that the assump- 
tion of a pairwise additive potential plus the mean-field 
approximation predicts the wrong scaling with (-D/^) of 
the polymer contribution to Aji. It greatly overestimates, 
by a factor of order {^/D)^/^, the effect of adding poly- 
mer whose correlation length is large with respect to the 
protein diameter. Naively using the pair potential between 
an isolated pair of particles, (a pair of particles with no 
others within a distance ^ of them) , as the pair interaction 
within a dense, number density » (here the crystal), 
is qualitatively wrong. 

4 Conclusion 

We have estimated the change in the barrier to nucleation 
when polymer is added to a metastable dilute protein so- 
lution. The polymer is semidilute, and has a correlation 
length ^ larger than the diameter of the protein D; the 
polymer does not adsorb onto the surface of the protein. 
When polymer is added, the barrier AF* is reduced due 
to the increase in the chemical potential of the protein in 
solution, possibly by enough to allow nucleation. However, 
the effect is not large, A/x decreases by only a few tenths 
of kT. Also, a naive use of the polymer induced attraction 
between a pair of proteins yields results which arc qualita- 
tively wrong: they grossly overestimate the effect of adding 
polymer. This does not mean that an effective potential 
approach to the effect of polymer cannot reliably estimate 
the effect of large polymer molecules, just that this effec- 
tive potential is many-body in nature and cannot simply be 
approximated by just a pair potential. This conclusion also 
applies to the phase behaviour of a system of hard-sphere- 
like colloidal particles and larger polymer molecules. 

Finally, we note that the second virial coefficient B2 is 
often used to measure the strength of the attractions in a 
protein solution. B2 is given by 

B2 = B2,o + 11 dr^^^ W2 « kT (12) 

^ Jr>D 

~ B2,^-D^'^e'\ (13) 

where i?2,o is the second virial coefficient in the absence 
of polymer and we have assumed not only that 11)2 /kT is 
small but that ^2,0 is dominated by interactions with a 



range <C ^: then the second virial coefficient is the sum 
of the two terms in Eq. (12). So as the polymer density 
increases, ^ decreases and the contribution of polymer to 
B2 decreases. But from Eq. (6) we see that as the poly- 
mer density increases it has a larger and larger effect on 
the nucleation barrier. So, the effect of polymer on B2 is 
anii-correlated with its effect on AF*, i.e., when one goes 
up the other goes down, not correlated as we might naively 
have expected. 
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